Generic scale of the "scale-free" growing networks 
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We show that the connectivity distributions P{k, t) of scale-free growing networks [t is the 
network size) have the generic scale - the cut-ofF at kcut ~ t^ . The scaling exponent /3 is related to 
the exponent 7 of the connectivity distribution, /3 = 1/(7 — 1). We propose the simplest model of 
scale-free growing networks and obtain the exact form of its connectivity distribution for any size 
of the network. We demonstrate that the trace of the initial conditions - a hump at kh ~ fccut ~ t^ 
- may be found for any network size. We also show that there exists a natural boundary for the 
observation of the scale-free networks and explain why so few scale-free networks are observed in 
Nature. 
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A significant progress was made recently in the field of 
evolving networks ||l|-01. It was observed that a number 
of growing networks in Nature (World-Wide Web, Inter- 
net, networks of scientific citations, collaboration nets, 
some networks in biology, etc.) are scale-free, i.e., their 
connectivity distribution is of a power-law form. More- 
over, it was found that at least many of them have to he 
scale-free, otherwise growing networks are not resilient 
enough to random breakdowns |^,^. The infinite scale- 
free network with the connectivity distribution exponent 
7 < 3 does not decay for any concentration (less than 
one) of randomly removed links 

The proposed mechanism of self-organization of net- 
works into scale-free structures, the preferential linking, 
is quite natural [lO| ] . New links of the growing networks 
are preferentially attached to nodes which already have 
many connections (connectivity k). In fact, it is the 
realization of a general principle - popularity is attrac- 
tive. Several types of preferential linking were proposed 
|p^-|l5| which provide a variety of the 7 exponent values 
between 2 and infinity. 

One should emphasize that only a few scale-free net- 
works is known yet. The range of the values of the con- 
nectivity, in which the power-law behavior can be ob- 
served, is usually too narrow for a precise measurement 
of the exponent 7. It is unclear, why are so few scale-free 
networks observed? Why are the values of 7 for all of 
them only between 2 and 3? (Note that not any network 
has to be resilient, e.g., neither nodes nor links of col- 
laboration networks are removable by definition [T^ .) In 
the present Letter, we answer these questions. 

In previous papers, the connectivity distributions 
P{k, t) of scale-free networks were calculated in the "ther- 
modynamic limit", i.e., in the limit of the large system 
size, t, which also plays the role of time, if one node is 
added at each increment of time. In this case, the distri- 
bution is stationary, and is of the form P{k) k~^ in all 
range of large enough k, k ^ 1. Nevertheless, real net- 
works are finite. The evolution of P{k, t) to the station- 
ary distribution turns to be non trivial. We demonstrate 



below that, for finite networks, the power-law region of 
the connectivity distribution has the cut-ofF at kcut ~ ^'^ , 
where (3 = 1/(7—!). We show that the trace of the initial 
conditions, i.e, of the initial configuration of the network 
- the hump at kh ~ kcut ^ - may be observed at any 
size of the network. 

This cut-off in the connectivity distribution allows ob- 
servation of the power-law dependence only for very large 
networks. We show that for large values of 7, the power- 
law dependence is practically unobservable. 

Two answers have been already given to the question: 
Why are the observed scale-free networks such as they 
are? First, - because their evolution is determined by 
the preferential linking mechanism ||lo[] . Second, - be- 
cause otherwise they would be unstable, weak against 
processes of decay and could not exist as united systems 
1^,^. Here, we propose the third answer, - because oth- 
erwise they would he unohservable, i.e., it would be im- 
possible to observe a power-law distribution. 

We demonstrate these features of the connectivity dis- 
tribution using the simplest model of the scale-free grow- 
ing network for which we present the exact solution - the 
implicit form of the connectivity distribution for all sizes 
of the network. Also, we obtain them from general con- 
siderations. One should note that the introduced model 
is interesting by itself, so we present briefiy main exact 
results for it. 

Let us introduce the simplest model of the scale-free 
growing network with undirected links (see Fig. |l|). Ini- 
tially {t = 2), there are three nodes, s = 0,1,2, each 
with the connectivity 2. (The connectivity of a site is a 
number of its connections.) 

(i) At each increment of time, a new node is added. 

(ii) It is connected to both ends of a randomly chosen 
link by two undirected links. 

As far as we know, it is the simplest model of a scale- 
free network. The preferential linking arises in it not 
because of some special rule including a function of con- 
nectivity as in | pO| but naturally. Indeed, in the model 
that we consider, the probability that a node has the 
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randomly chosen link attached to it is equal to the con- 
nectivity k of the node divided by the total number of 
links, 2t ~ 1. Therefore, the evolution of the network is 
described by the following master equation. 



p{k, s,t + I) 



k-1 
2t- 1 



, 2t - 1 - A: 
1; i) + — — P{k, s, t), 



2t-l 



(1) 



with the initial condition, p(/c, s = {0, 1, 2}, t = 2) — Sk.2- 
Also, p{k,t,t) = 6k,2- Here p{k,s,t) is the probabihty 
that the site < s < t has k connections at time t. Note 
that this master equation and all the following ones are 
exact for alH > 2. Eq. (]l|) has the form similar to that 
of the Barabasi- Albert's model [0. Therefore, one may 
expect that the scaling exponents of these models have 
to coincide. 

FromEq. (0), we can obtain a number of usefuU exact 
relations for our model. In particular, from Eq. (]l|), one 
may find the equation for the average connectivity of an 



individual node, k{s, t) = X]fc=2^^ kp{k, s, t): 

— 2t — — 

k{s,t + l) = ——k{s,t) , fc(t,t)=2. 



2t-l ' ' ' ' 
One can obtain easily its solution: 



k{s,t) 



i ft^l)! (2s -3)!! 
(s-1)! {2t~3)\\ 



(2) 



(3) 



Here, s > 2 and k{0,t) = fc(l,<) = fc(2,t). Hence, 
the scaling exponent (3, defined through the relation, 
k{s,t) oc {s/t)~^, equals 1/2 like for the Barabasi- 
Albert's model. 



Also, one may find the average number 6(s, s') of 
links between the sites s and s' for any s < s' < i, 
< 6(s, s') < 1. In fact, 6(s,s') is the average of the 
element of the connectivity matrix over all possible re- 
alizations of the growth. The equation for this quantity 
is 



5(s,s' + l) = 



1 



2t - 1 



^5(u,s)+ ^ b{s,u) 

-u— u—s-\-l 



(4) 



Its exact solution for s < s' is of the form: 
„,_,Gs'-2)! (2s -3)!! «^^^>i 



5(s,s') 



(s-1)! (2s' -3)!! 



(5) 



and 6(0, s') = 6(1, s') = 6(2, s') = 1. 

We found exactly the connectivity distribution of the 
oldest nodes, p(fc, 0, t) = p(fc, 1, t) = p{k, 2, t): 



p{k,2,t) 



(fc-1) (2i-fc-2)! t»fe (fc-1) 



2*-'=(i-/c)! (2<-3)!! 



2t 



(6) 



This relation turns to be useful for finding the total con- 
nectivity distribution. Also, one may obtain the relation, 
p{2,s,t) = (2s-3)/(2i-3). The scaling form of p(fc, s, t) 
for k,s,t ^ 1 and ky^s/t fixed is obtained using the Z- 
transform for the connectivity, k. The scaling relation is 
of the form: 



p{k,s,t) = \ - \kJ-] exp 



-kj- 



(7) 



This is a particular case of the corresponding scaling re- 
lations for the scale-free networks ||l^, see Eq. (p^). 



The matter of interest is the total connectivity distribution, P{k,t) = ^g^Qp{k^ s^t) / {t + 1). The equation for it 
can be derived from Eq. (|^): 



P(fc,t) 



i+ 1 



1 



with the initial condition P(fc, 2) = 
The exact solution of Eq. (^) is: 



2t-c 

4,2- 



P(fc- l,i- 1)+ 1 



P(fc,t) = 



24 



1 



{2t-k-2)\ 
k{k + l){k + 2) (t+l)(2t-3)!! 2*-'=(t- fc)! 



2t-i 



P{k,t-l) 



(t-k) 



t + 



(fc-2)(fc + l) 



3fe,2 



(8) 



(fc- l)fc(fc + l)(/s + 2) 



(9) 



One may check Eq. inserting it directly into Eq. 
(^. We obtained Eq. (^ using the distribution function 
P(fc,t) = Y^^^^pik^s^t)/ (t — 2), which looks less cum- 
bersome than P{k,t) and may be found without great 
problems, and the expression for p(fc, 2, i), Eq. (^). 
From Eq. (0) with t cxd, it follows the equation for 



the stationary distribution, P(fc), 

(fc - l)P(fc - 1) - [k + 2)P{k) + 24,2 = 
where the solution is: 

P(fc) = 



k{k + l){k + 2) 



(10) 
(11) 
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Eq. ( [Tl] ) is similar to the form of the stationary connec- 
tivity distribution found for the Barabasi-Albert's model 
lHQ. One sees that 7 = 8. 

Our aim is to find how the stationary distribution is 
reached. From Eq. (||), for t ^ k ^ 1, one gets 



P(fc,i) ^ P{k) 



IP 1 /fc2 



lfc2 
4 t 



(12) 



The factor P{k,t)/ P{k) = g{k/^/i) depends only on 
the combination k/^/i. Therefore, the peculiarities of 
the distribution induced by the size effects never disap- 
pear but only move with increasing time in the direc- 
tion of large connectivity. The function g{k/^/i) is close 
to 1 for k < ^/i, has a hump at k,nax between ^/i and 
4:\/t with a maximum at kmax/Vt = = 2.449 . . ., 
g{k„iax/Vi) = 7e"^/^ = 1.562..., and the cut-off at 
k ~ 4Vi (see Fig. H). Hence, the power-law behavior is 
observable only in a rather narrow region, 1 ^ fc <C Vt. 

One may check that the form of the hump in Fig. |^ 
depends on the initial conditions. In our case, the evolu- 
tion starts from the configuration shown in Fig. |l|,a. If 
we start the growth from another configuration, the form 
would be different. 

We have demonstrated above the size-dependence of 
the connectivity distribution using the exactly solvable 
example. What are the general reasons of such behav- 
ior of the scale-free networks? Let us obtain the general 
estimation of the distribution cut-off position for an ar- 
bitrary scale-free network. 

Measuring of connectivity distributions is always im- 
peded by the strong fiuctuations at large k. The rea- 
son of such fiuctuations is the poor statistics in this re- 
gion. One can easily estimate the characteristic value, 
kf, above which the fiuctuations are strong. Let n be 
the total number of links of the network, and 7 > 2. 
For the linearly growing network, n = mt, where m is 
the number of links added at each increment of time. If 
P{k) ^ k '', nkj'^ ~ 1. Therefore, kf ^ n^l^ . One may 
improve the situation using the cumulative distributions, 
= dkP{k), instead of P{k). Also, in simulations, one 
may make a lot of runs to increase the statistics. Never- 
theless, one can not pass the cut-off, kcut, that we discuss. 
This cut-off is the real barrier for the observation of the 
power-law dependence. 

We have shown that the connectivity distribution of 
individual sites is an exponentially decreasing function 
at large k (see Eq. (Q)). For the scale-free networks, it 
can be written in the general scaling form ]l3[ |: 

p(fc,5,t) = (^)''/(^fc(^)'') , (13) 



where f{x) is a scaling function, and the relation [ [f2||l3| 
between the exponents (3 and 7 is 



/3(7 - 1) = 1 



(14) 



In the particular case of the proposed model, f{x) = 
xexp(— x). The exponent /3 also figures in the relation 
for the average connectivity, k{s,t) oc {s/t)~^. It fol- 
lows from Eq. ([l3| ) that the cut-off of the total distri- 
bution is determined by the connectivity distribution of 
the individual nodes with the smallest number s, i.e., 
by the oldest ones. Therefore, kcut{^/t)^ const and 
kcut = ^1/(^-1). For the considered model, /9 = 1/2, 
see Eq. (|^). The connectivity distributions of the old- 
est nodes (and the quantity of them) depend strongly on 
the initial conditions. Hence, the part of the total con- 
nectivity distribution near the cut-off depends strongly 
on this factor. Now it becomes obvious why there are 
no scale-free networks with large values of 7. Indeed, 
the power-law dependence of the distribution can be ob- 
served only if it exists for at least 2 or 3 decades of the 
connectivity. For this, the networks have to be large, 
t > 10^'^'^'''^^-'. But there is only a few large networks in 
Nature! If 7 > 3, one practically has no chances to find 
the scale-free behavior. 

In Fig. H, in a log-linear scale, we present the sizes of 
all known scale-free networks vs their 7 exponent values. 
The plotted points are inside of the region restricted by 
the lines: 7 = 2, logj^g t ~ 2.5(7—!), and by the logarithm 
of the size of the largest scale-free network in Nature - 
the World-Wide Web, - log^ t - 9. 

We have demonstrated that the form of the connectiv- 
ity distribution is influenced by initial conditions even for 
large networks. Therefore, it is hard to obtain the values 
of the scaling exponents with high precision both from ex- 
perimental data and simulations. One should note that 
including the aging of nodes, breaking of links, or dis- 
appearing of nodes suppresses the effect of the initial 
conditions and removes the hump (sec the plots of the 
connectivity distributions in |p^). 

In conclusion, we have described the size effect on the 
connectivity distribution of the scale-free growing net- 
works. We have shown that the scale-free networks have a 
generic scale - the size-dependent cut-off, kcut t^^^"'~^^ 
of the connectivity distribution. This cut-off impedes 
observations of the power-law dependence even for large 
networks. For large 7, such observations are impossible. 
If 7 ^ 2, then kcut ^ t, so, in fact, the cut-off is absent. 
We have estimated the region of the network sizes and 
the values of the exponent 7 in which the power law is 
visible. All found scale-free networks are in this region. 
We have shown that the trace of the initial configuration 
of the network - the hump near kcut ~ may be observed 
for all sizes of the network. We have demonstrated such 
behavior using the simplest model of a scale-free grow- 
ing network. It turned to be possible to find the exact 
solution of it for any size of the network. Also, these re- 
sults have been obtained from general considerations. In 
fact, they are general and applicable to systems display- 
ing power-law distributions. 

The proposed model belongs to the class of the exactly 
solvable scale-free growing networks. One can consider 
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ano ther simple model. Instead of the connecting of a [14] P.L. Krapivs ky, S. Redner, and F. Leyvraz, cond- 



new node with the ends of a randomly chosen link of the 



mat/0005139, to appear i n Phys. Rev. Lett 



network, one may connect it each time with all three ver- 
tex nodes of a randomly chosen triangle of links. (Note 
that we forbid multiple links.) Such a model has the 
same scaling exponents as the considered one. 

It follows from our results, that one can not see the 
scale-free networks with large 7. Also, if we do not ob- 
serve the scale-free connectivity distributions of some 
growing network, this does not mean at all that it is not 
a scale-free one. There is a chance that the power-law 
behavior will be found after some time, when the net- 
work will grow up. 
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t=4 t=5 

FIG. 1. Illustration of the simplest model of scale-free 
growing networks. In the initial configuration, t — 2, three 
sites are present, s = 0, 1, 2 (a). At each increment of time, a 
new node with two links is added. These links are attached 
to the ends of a randomly chosen link of the network. 
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FIG. 2. Deviation of the connectivity distribu- 

tion of the finite-size networlt from the stationary one, 
P{k,t)/ P{k,t —> oo), vs kjyft. The form of the hump de- 
pends on the initial configuration. 




FIG. 3. Log-linear plot of the size vs the 7 expo- 
nent value of the all observed scale-free networks. The line 
logj^o ^ ~ 2.5(7— 1) is the finite-size boundary for the observa- 
tion of the power-law connectivity distributions. The dashed 
line, 7 = 3, is the resilience boundary. This boundary is im- 
portant for those growing networks which have to be stable to 
random breakdowns. The points: la and Ife are obtained for 
incoming- and outgoing links of the pages of the World-Wide 
Web [Ql^ (also, 7in = 2.1 and 7out = 2.45 were obtained from 
the complete map of the nd.edu domain of the Web, 325, 729 
nodes |l^, 7™ ~ 1.94 was obtained for the domain level of 
the Web in spring 1997 |l^), 2o is for outgoing links for the 
inter-domain structure of the Internet and 26 is for outgoing 
links for the Internet at the router level 3a and 36 are 
for citations of the ISI data base and Phys. Rev. D 4 
is for the coUaboration network of MEDLINE [0, 5 is for 
the collaboration network of movie actors [Q, (also, 7 = 2.3 
was obtained for this network in Q) 6 is for incoming and 
outgoing links of the networks of the metabolic reactions . 
The precision of the upper points is about ±0.05 and is much 
worse for points in the dashed region. 
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